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Abstract 

The dominant cutoff effects in lattice QCD with Wilson quarks are pro- 
portional to the lattice spacing a. In particular, the isovector axial current 
satisfies the PCAC relation only up to such effects. Following a suggestion of 
Symanzik, they can be cancelled by adding local 0(a) correction terms to the 
action and the axial current. We here address a number of theoretical issues 
in connection with the 0(a) improvement of lattice QCD and then show that 
chiral symmetry can be used to fix the coefficients multiplying the correction 
terms. 
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1. Introduction 



A well-known deficit of Wilson's formulation of lattice QCD is that the 
chiral symmetry of the theory is not preserved [1]. This is usually not con- 
sidered to be a fundamental problem, because there is every reason to expect 
that the symmetry is restored in the continuum limit. But one should bear in 
mind that numerical simulations of lattice QCD are limited to relatively large 
lattice spacings a. Chiral symmetry may then still be rather strongly violated 
by lattice effects. 

A clear demonstration of this has recently been given in ref. [2]. The 
tests reported there have been performed for the case of quenched QCD with 
two mass-degenerate light quarks. In the continuum limit of this theory the 
isovector axial current is expected to satisfy the PCAC relation 

d^Al = 2mP a , (1.1) 

where P a denotes the associated axial density and m the quark mass. By 
considering matrix elements of eq. (1.1) between various low-energy states 
it was found that the lattice corrections to the relation are far from being 
negligible on the accessible lattices. 

With the current simulation algorithms one needs at least a factor 32 more 
computer time to be able to reduce the lattice spacing by a factor 2 at constant 
statistical errors and physical length scales. At the same time the lattice effects 
(which are of order a in the Wilson theory) are only reduced by 50%. In view of 
this unfavourable situation it is evidently desirable to develop better strategies 
to bring the cutoff effects under control. 

A comparatively simple and theoretically attractive possiblity is to apply 
Symanzik's improvement programme [3,4], where the 0(a) cutoff effects in on- 
shell quantities (particle energies, scattering amplitudes, normalized matrix 
elements of local composite fields between particle states, etc.) are cancelled 
by adding local 0(a) counterterms to the lattice action and to the composite 
fields of interest [5-8]. In the so constructed "on-shell improved" theory the 
continuum limit is reached much faster (with a rate proportional to a 2 ), while 
the computational cost of the terms added remains small [9-12]. 

On lattices without boundaries only one counterterm, the Sheikholeslami- 
Wohlert term [6], needs to be included in the lattice action and not many 
more are required to improve the low-dimensional operators such as the axial 
current and density [2,8]. A technical difficulty however is that the coefficients 
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multiplying these counterterms are not a priori known. They are functions 
of the gauge coupling and should be chosen so that the 0(a) cutoff effects in 
on-shell quantities cancel. 

The main observation made in this paper is that the coefficient c sw mul- 
tiplying the Sheikholeslami-Wohlert term can be determined by studying the 
violations of the PCAC relation (1.1) on the lattice. Chiral symmetry restora- 
tion and 0(a) improvement thus get tied up in an interesting way. The idea 
works out both in perturbation theory and non-perturbatively, but here we 
only set up the theoretical framework and defer all detailed computations to 
two separate papers [23,24]. A short account of the method has already ap- 
peared in ref. [2]. 

The present paper also gives us the opportunity to discuss the problem of 
on-shell 0(a) improvement from a somewhat novel point of view (sects. 2 and 
3). In particular, the role played by the 0(a) counterterms proportional to the 
quark mass is clarified, and it is shown that mass-independent renormalization 
schemes must be set up with care if 0(a) improvement is to be preserved. 

The matrix elements used to study chiral symmetry restoration are con- 
structed from the QCD Schrodinger functional [13-15,17] which we introduce 
in sect. 4. For completeness we also derive the form of the 0(a) boundary 
counterterms that must be included in the action to improve the Schrodinger 
functional (sect. 5), although this is not really needed when we discuss the 
lattice corrections to the PCAC relation (1.1) in sect. 6. The paper ends with 
a few concluding remarks and a series of technical appendices. 



2. On-shell O(a) improvement revisited 

Our aim in this section is to derive the form of the 0(a) counterterms to 
the lattice action and the axial current and density that are required for on-shell 
improvement. Although the improved action has previously been obtained by 
Sheikholeslami and Wohlert [6], we think it is worthwhile to go through the 
argumentation again in a slightly different way. The extension of the discussion 
to the Schrodinger functional will then be rather easy (sect. 5). 
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2.1 Preliminaries 

In this section we choose to set up the theory on a four-dimensional hyper-cubic 
euclidean lattice with spacing a and infinite extent in all directions. Most of 
our notational conventions are collected in appendix A. The gauge group is 
taken to be SU(7V) and we assume for simplicity that there are N{ flavours 
of mass-degenerate light quarks, although it would not be very much more 
difficult to treat the case of light quarks with different masses. 

A gauge field U on the lattice is an assignment of a matrix U(x,fj,) G 
SU(./V) to every lattice point x and direction fj, = 0,1,2,3. Quark and anti- 
quark fields, ip(x) and ip(x), reside on the lattice sites and carry Dirac, colour 
and flavour indices. The (unimproved) lattice action is of the form 

S[U^,i,] = S G [U] + S F [U^,i,], (2.1) 

where Sq denotes the usual Wilson plaquette action and Sp the Wilson quark 
action. Explicitly we have 

SG[E/] = ij>{l-tf(p)} (2.2) 

9o p 

with </o being the bare gauge coupling and U(p) the parallel transporter around 
the plaquette p. The sum runs over all oriented plaquettes p on the lattice. 
To define the quark action Sp we first introduce the Wilson-Dirac operator 

^ = H7^(V* + V A1 )-aV*V A1 }, (2.3) 

which involves the gauge covariant lattice derivatives and V* defined in 
appendix A. The action then assumes the standard form, 

S F [U,^,ij] = a 4 J2H x )( D + m o)^(x), (2.4) 

X 

where mo denotes the bare quark mass. 
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2.2 Local effective theory 

Close to the continuum limit the lattice theory defined above may be described 
in terms of a local effective theory with action 



The leading term, So, is just the action of the continuum theory, while the other 
terms are to be interpreted as operator insertions in the continuum theory. In 
his analysis of the cutoff dependence of lattice field theories, Symanzik [3,4] 
defines the continuum theory using dimensional regularization, but we could 
also employ a lattice with spacing very much smaller than a to give a precise 
meaning to So and the operator insertions. 

The correction terms in the effective action are of the form 



where the lagrangians £k( x ) are linear combinations of local composite fields 
of dimension 4 + k. The dimension counting here includes the (non-negative) 
powers of the quark mass m by which some of the fields may be multiplied. 
From the list of all possible such fields only a small subset occurs in the effective 
action. First of all, since one integrates over the position x, one can use partial 
integration to reduce the number of terms that must be included. The remain- 
ing terms must be invariant under gauge transformations and U(l) xSU(TVf) 
flavour rotations. They should also respect the exact discrete symmetries of the 
lattice theory. This includes all space-time lattice symmetries and the charge 
conjugation symmetry (appendix B). 

Taking these remarks into account one finds that the order a effective 
lagrangian, Ci(x), must be a linear combination of the fields 



= S + aSi + a S 2 + . . . 



(2.5) 




(2.6) 



Ox = ipcjfj.uFfj.uip 



(2.7) 



2 = ijD fl D^ + ijb fl D^ 



(2.8) 



C>3 = mtv{F^F^} 



(2.9) 



e> 4 = m 



(2.10) 



O5 = m 2 ipip 



(2.11) 
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where F^ v denotes the gauge field strength and the gauge covariant partial 
derivative (cf. appendix A). In subsects. 2.3 and 2.4 we shall come back to this 
list of fields and show how it can be reduced to essentially one term. 

Cutoff effects originate not only from the lattice action but also from 
the local composite fields that one is interested in. So let us consider some 
local gauge invariant field (f>(x) constructed from the quark and gluon fields 
on the lattice. For simplicity we assume that (f>(x) does not mix with other 
fields under renormalization. We then expect that the connected renormalized 
ra-point correlation functions 

G n (x ll ...,x n ) = {Z 4> ) n ^{x l )...^{x n )) con (2.12) 

have a well-defined continuum limit, provided the renormalization factor is 
chosen appropriately and if all points x\, . . . , x n are kept at non-zero (physical) 
distances from one another. 

In the local effective theory the renormalized lattice field Z^<j){x) is repre- 
sented by an effective field 

<f>ef[( x ) = 4>o{ x ) + a<f>i( x ) + a 2 (f>2(x) + . . . (2.13) 

The fields (f>k(x) appearing here are linear combinations of local fields with 
the appropriate dimension and symmetry properties. To order a the lattice 
correlation functions are then given by 

G n (xi, ...,x n ) = (<f>o{xi) ■ • •<Mzn)) con 

- a J d 4 y ((j)o(xi) . . • <^o (« n )^i (y)) con 

n 

+ aJ2(M x i) ■ ■■<t>i(*k) ■ • •0o(z„)) con + 0(a 2 ), (2.14) 
fc=i 

where the expectation values on the right hand side are to be taken in the 
continuum theory with action So- The second term is the contribution of the 
order a term in the effective action. Note that the integral over y in general 
diverges at the points y = x^. A subtraction prescription must hence be 
supplied. The precise way in which this happens is unimportant, because the 
arbitrariness that one has amounts to a local operator insertion at these points, 
i.e. to a redefinition of the field (j)i(x). 
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It should also be emphasized that not all the dependence on the lattice 
spacing comes from the explicit factors of a in eq. (2.14). The other source 
of a-dependence are the fields <j)\{x) and Ci(y), which are linear combinations 
of some basis of fields. While the basis elements are independent of a, the 
coefficients need not be so, although they are expected to vary only slowly 
with a. In perturbation theory the coefficients are calculable polynomials in 
In (a). 

2.3 Cutoff dependence of on-shell amplitudes 

Eventually all on-shell quantities in QCD can be extracted from correlation 
functions of local composite fields. An important point to note is that for 
this the correlation functions are only required at non-zero physical distances. 
We would now like to show that the local effective theory can be simplified 
considerably if attention is restricted to such correlation functions. As an 
example we discuss the functions G n introduced above, but the argumentation 
is readily carried over to correlation functions involving several kinds of fields. 

Let us first consider the second term on the right hand side of eq. (2.14). 
The effective lagrangian C\(y) is a linear combination of the fields (2.7)-(2.11). 
As long as y keeps away from the points Xk, the field equations of the continuum 
theory may be applied and one concludes that certain linear combinations of 
these fields do not contribute to the correlation function. This remains true 
after integration over y up to contact terms that arise at the points x\, . . .,Xk. 
Any such contact term amounts to an operator insertion, which, in eq. (2.14), 
can be compensated by a redefinition of the field <j)\ (the dimensionalities and 
symmetries of the fields involved do not allow for any other form of the contact 
terms). Since we do not insist on any particular expression for (f>i(x), but only 
require that eq. (2.14) holds, we are hence free to apply the field equations to 
simplify the effective lagrangian. 

The linear combinations of the fields (2.7)-(2.11) that vanish are easily 
found at tree-level of perturbation theory by applying the classical field equa- 
tions. There are two such relations which allow us to eliminate O2 and O4. At 
non-zero couplings the coefficients in the vanishing linear combinations change, 
but since we are only interested in reducing the number of basis fields, it is 
enough to know that the linear relations allow one to express some of them 
through the other basis elements. This information is invariant under small 
changes of the coefficients so that, barring singular events, we may take L\ to 
be a linear combination of 0\ , O3 and O5 ■ 

Similar arguments may be used to eliminate some of the terms that con- 
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Fig. 1. Graphical representation of the products of gauge field vari- 
ables contributing to the lattice field strength tensor (2.18). Each square 
corresponds to one of the terms in eq. (2.19). 

tribute to the field (f>i(x). The only place in eq. (2.14) where (j)i(x) appears is 
the third term on the right hand side. Since all positions x\, . . .,x n are kept 
at non-zero distances from each other, no contact terms can arise when the 
field equations are applied in this correlation function. The number of basis 
fields from which (j)i(x) is constructed can thus be reduced as in the case of 
the effective lagrangian. 

2.4 Improved lattice action 

Our aim is to construct an improved lattice action by adding a suitable 0(a) 
counterterm to the Wilson action (2.1). The counterterm should be chosen such 
that the order a term in the effective action is cancelled. Since we are only 
interested in on-shell amplitudes, we may assume that the effective lagrangian 
£1 is a linear combination of the fields 0\, O3 and 0$. It is then quite obvious 
that £1 can be made to vanish by adding a counterterm of the form 

a 5 {ci^iO*) + cMx) + c 5 d 5 (x)} , (2.15) 

X 

where Ok is some lattice representation of the field Ok- 

Apart from renormalizations of the bare parameters and adjustments of 
the coefficients Ck, the discretization ambiguities that one has here are of order 
a 2 . In particular, we may choose to represent the fields tr {F^F^} and ipip by 
the Wilson plaquette field and the local scalar density that already appears in 
the Wilson quark action. The 0(a) counterterms proportional O3 and O5 then 
amount to a renormalization of the bare coupling and mass. At first sight one 
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might think that such reparametrizations are insignificant, but as explained in 
sect. 3, this is not quite true if a mass-independent renormalization scheme is 
employed. For the time being we ignore this complication and simply drop the 
contributions of O3 and O5. 

For the on-shell 0(a) improved action we thus obtain 

S impi [U, ^] = S[U, A V] + 6S[U, A V], (2-16) 
SS[U,^,ij] = a 5 J2 c ^( x )i (J ^^(x)^(x), (2.17) 

X 

where S[U, ip, ip] is the Wilson action and F^ v a lattice representation of the 
gluon field strength tensor F^ v . We adopt the standard definition 

%A x )=^-i{Q^)-Q,4 x )}, (2.18) 

Qiiv{x) = U(x, fi)U(x + afi, v)U{x + av, fi)~ 1 U(x, v)~ l 

+ U(x, v)U{x — afi + az>, fi)~ 1 U(x — afi, v)~ l U{x — afi, fi) 

+ U(x — afi, fi)~ 1 U(x — afi — az>, v)~ l U{x — afi — az>, fi)U(x — aD, v) 

+ U(x — aD, v)~ l U{x — aD, fi)U(x + afi — az>, u)U(x, /u) _1 . (2.19) 

The four terms in this equation correspond to the four plaquette loops shown 
in fig. 1. 

The 0(a) counterterm (2.17) has first been written down by Sheikholes- 
lami and Wohlert [6]. In their paper they perform field transformations in 
the functional integral of the lattice theory to argue that only this term is 
required for on-shell improvement. Our strategy here has been to achieve a 
reduction of the number of terms already at the level of the effective action. 
This treatment is simpler in our opinion, because the discussion of the form of 
the 0(a) correction terms takes place entirely in the continuum theory. 

The coefficient c sw multiplying the Sheikholeslami- Wohlert term in the 
improved action is a function of the bare coupling go and must be chosen so 
that the 0(a) cutoff effects in on-shell quantities cancel. It has been shown in 
ref. [6] that c sw = 1 to lowest order of perturbation theory. The coefficient has 
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later been worked out to one-loop order by Wohlert [7]. His results will be red- 
erived in ref. [23] and a non-perturbative determination of c sw , in the quenched 
approximation and for gauge group SU(3), will be reported in ref. [24]. 

2.5 Improved axial current and density 

From the discussion in subsect. 2.2 it is clear that not all 0(a) effects can be 
removed from the correlation functions of local composite fields by employing 
an improved action. One also has to use improved fields to achieve this. They 
are constructed in very much the same way as the improved action. 

So let us assume that (j)(x) is some given (unimproved) composite field on 
the lattice. The isovector axial current and density, 

Al(x)=^(x) 7ll j^T a i,(x), (2.20) 

F a (i) = #) T5 ir a #), (2.21) 

where r a is a Pauli matrix acting on the flavour indices of the quark field, are 
examples of such fields. We then determine the general form of the order a 
term, (f>i(x), in the expansion (2.13) of the associated effective field. Since we 
are only interested in the improvement of on-shell matrix elements, the classical 
field equations may be used to eliminate some of the basis fields contributing 
to (f>i(x) (cf. subsect. 2.3). The on-shell 0(a) improved lattice field is then 
given by 

^(x) = <j)(x) + a6<f>(x), (2.22) 

where 8<j){x) is obtained by taking the general linear combination of a lattice 
representation of the remaining basis fields. 

Taking into account the transformation behaviour of the axial current 
(2.20) under the lattice symmetries and charge conjugation, it is straightfor- 
ward to show that a list of all possible basis fields is in this case given by 

(0 6 );=^ l5 ^T a a^D^-^b u a^ l5 ^T a ij, (2.23) 

(O r )l = ^±r a 75 Z^ + ? Vr 5 ±r>, (2.24) 

= m^hVTsir ^. (2.25) 

The first of these can be related to the other two by the field equations and 
so may be dropped. The 0(a) counterterm associated with {p&) a amounts 
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to a renormalization of the axial current. Since we have not imposed any 
renormalization conditions so far, we may at this point just as well drop this 
term (the issue will be addressed again in sect. 3). 

We are then left with the second term in the list above and conclude that 

8Al(x) = c A \(dl + d,)P a (x). (2.26) 

In the case of the axial density a similar analysis shows that 

6P a (x) = 0, (2.27) 

where we have again ignored a mass-dependent renormalization factor. 

The coefficient ca depends on the gauge coupling and is to be chosen so as 
to achieve the desired improvement. In perturbation theory ca is of order g$, 
because at tree-level the local axial current (2.20) is already on-shell improved 
(up to the mass-dependent renormalization factor mentioned above). This has 
previously been noted in ref. [8]. The coefficient is worked out to one-loop 
order of perturbation theory in ref. [23] and it is also possible to compute it 
non-perturbatively, using numerical simulations [24]. 



3. Mass-independent renormalization schemes 

In this section we address the problem of the proper parametrization of 
the improved theory and clarify the role played by the order a terms appearing 
in the local effective theory that correspond to renormalizations of the bare 
parameters and fields. 

3.1 Renormalization and 0(a) improvement 

In the improved theory the renormalization conditions on the gauge coupling, 
the quark mass and the improved composite fields must be chosen with care. 
What we would like to achieve is that the correlation functions of the renormal- 
ized fields, at fixed non-zero physical distances and fixed renormalized coupling 
and mass, converge to the continuum limit with a rate proportional to a 2 . Of 
course this is only possible if the coefficients c sw , ca, etc. have been assigned 
their proper values (which we assume is the case). 
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An obvious possibility is to impose all renormalization conditions on a set 
of renormalized correlation functions defined at the same point (go,anio) in 
the bare parameter space. The renormalized amplitudes in such a scheme are 
evidently inert against transformations of the bare parameters and rescalings of 
the bare fields. Without loss the corresponding 0(a) counterterms may hence 
be dropped (as we did in sect. 2). This also means that the complete list of 
counterterms has been taken into account and that, therefore, no uncancelled 
0(a) corrections can arise. In other words, renormalization schemes of this 
type are automatically compatible with 0(a) improvement. 

A disadvantage of this procedure however is that the renormalized coupling 
and fields implicitly depend on the quark mass. Mass-independent schemes, 
where one imposes the renormalization conditions at zero quark mass, are 
intrinsically simpler and certainly better suited to discuss the scale evolution of 
the renormalized parameters [22]. The problem then is that the massive theory 
must be related to the massless theory, a link which is usually established via 
the bare parameters. As a result reparametrizations of the bare theory can no 
longer be ignored if 0(a) improvement is to be preserved. 

These remarks will become clearer below, where we consider two examples 
for illustration. We shall then set up the general mass-independent renormal- 
ization scheme respecting 0(a) improvement. Specific schemes are discussed 
later in this section. 

3.2 Naive mass-independent schemes 

In the plane of bare parameters a critical line 



is expected to exist, where the physical quark mass vanishes. Our aim is to 
parametrize the theory around this line. It is then useful to introduce the 
subtracted mass 



As an aside we remark that m c (go) depends on how precisely the physical 
quark mass is defined. Different definitions lead to values of m c (go) that differ 
by terms of order a 2 . The issue will be addressed again in subsect. 6.6 and 
also in ref. [24]. In this section order a 2 corrections are considered negligible 
and a precise definition of the critical bare mass is then not required. 

It is common to assume that the renormalized coupling </ R and the renor- 
malized mass to r in a mass-independent scheme are related to the bare param- 





TO q = TOq — TO C . 
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eters through 



(3.3) 



to r = m q Z m (gQ, a/i) 



(3.4) 



where fj, is a normalization mass and 



Z{gl,an) = 1 + (a/i)5o + ^ (2) (a^ko + • • • 



(3.5) 



for Z = Zg and Z = Z m . We now show that such schemes always lead to 
uncancelled 0(a) corrections in some renormalized amplitudes. 

At </o = the quarks decouple and their dynamics is described by the free 
Wilson quark action. According to eqs. (3.4) and (3.5) (and since to c = at 
zero coupling) the renormalized quark mass to r is equal to the bare mass too 
in this situation. We are thus supposed to take the continuum limit at fixed 
too, but as is well-known this leads to uncancelled 0(a) corrections in various 
places. A particularly obvious case is the "pole mass", 



which is equal to the energy of a free quark with zero momentum. It is possible 
to correct for this deficit by replacing eq. (3.4) through 



The pole mass to p then coincides with to r up to terms of order a 2 . 

One might hope to get away with a modification of eq. (3.4), as suggested 
above, but it turns out that eq. (3.3) cannot be valid either if 0(a) corrections 
in renormalized amplitudes are to be avoided. The argumentation is more 
difficult in this case, because the problem shows up only at one-loop order of 
perturbation theory. A relatively simple quantity to consider is the running 
coupling g 2 introduced in ref. [18]. g 2 is obtained from the QCD Schrodinger 
functional by computing the response of the functional to a change of the 
boundary values of the gauge field. We do not need to know any further 
details about the coupling here except that it is a well-defined function of the 
spatial extent L of the lattice, the lattice spacing a and the bare parameters go 
and too. 




(3.6) 




(3.7) 
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The perturbation expansion 

f=g 2 + [c^°)+c^N t ]gt + ... (3.8) 

has been worked out in ref. [18]. The 0(a) counterterms required for the 
improvement of the Schrodinger functional have been taken into account in this 
calculation (cf. sect. 5). If we insert the definition (3.7) of the renormalized 
quark mass, the result assumes the form 

c (^) = ^L\ n (L/a) + k l + 0(a 2 ), (3.9) 

c (M) = -—^ ln(L/a) + k 2 + k 3 am R + 0(a 2 ), (3.10) 
where k 2 is a function of m R L and 



k 3 = 0.012000(2). (3.11) 

We can now express g 2 as a series in the renormalized coupling g R by eliminat- 
ing the bare coupling. The renormalization factor (afi) should be chosen 
so as to cancel the logarithmic divergence, but since it may not depend on the 
quark mass, we cannot get rid of the term proportional to k 3 . We thus end 
up with an uncancelled O(a) correction and so conclude that eq. (3.3) must be 
modified to be compatible with O(a) improvement. 

3.3 Improved mass-independent schemes 

From the discussion in sect. 2 we now recall that a complete O(a) improvement 
of the theory in general requires a renormalization of the bare parameters by 
factors of the form 1 + &(<7o)ara q . We thus introduce a modified bare coupling 
and bare quark mass through 

9o = 9o (! + b g am q ) , (3.12) 

m q = TO q (1 + b m am q ) . (3.13) 

The coefficients b g and b m depend on g$ and should be chosen so as to cancel 
any remaining cutoff effects of order a. Note that the modified and ordinary 
bare coupling coincide along the critical line mo = ra c . The general mass- 
independent renormalization scheme, compatible with O(a) improvement, is 
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now given by 



(3.14) 



m R = m q Z m (gQ, a/i) 



(3.15) 



where Z g and Z m have an expansion of the form (3.5) with go replaced by (fa- 
it may be worthwhile at this point to discuss the significance of the modi- 
fied bare parameters go and fh q a little further. Our aim is to parametrize the 
bare theory in such a way that the continuum limit can be reached coherently 
for all quark masses and without 0(a) corrections. To approach the limit the 
bare parameters have to be scaled in a particular way. The basic observation 
is that the scaling required for go necessarily depends on the quark mass (if 
0(a) corrections are to be avoided), while go scales independently of the quark 
mass. A similar comment applies to the bare quark mass ra q , which must be 
scaled by a mass-dependent factor. The modified bare mass fh q , on the other 
hand, is scaled by a factor depending on a/2 only. It should be clear from these 
remarks that the coefficients b g and b m are well-determined and independent 
of the particular renormalization scheme chosen. 

So far we have been exclusively concerned with the parameter renormal- 
ization, but it is now straightforward to extend the discussion to the renormal- 
ization of multiplicatively renormalizable local fields. Suppose (f>(x) is such a 
field and let (f>i(x) be the associated improved field [eq. (2.22)]. Following the 
conventions adopted in subsect. 2.5, the 0(a) counterterm which amounts to a 
renormalization of the field by a factor of the form l-\-b(go)am q is not included 
in (f>i(x). It seems more natural to us to include this factor in the definition 



of the renormalized field. The coefficient plays a role completely analogous 
to 6 g and b m . In particular, it is independent of the renormalization condition 
chosen to fix Z^. 

In perturbation theory the 6-coefficients can be expanded according to 




(3.16) 



(3.17) 



From the discussion in subsect. 3.2 we infer that b". 



g — 



and 
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This suggests that the modified bare coupling is very nearly equal to g$, for 
all couplings of interest and quark masses am q less than say 0.1. In practice 
the difference can probably be ignored until very precise calculations become 
feasible. It should be noted, incidentally, that b g = in the quenched approxi- 
mation, because the purely gluonic observables (such as g 2 ) are automatically 
improved at non-zero quark mass if they are at zero quark mass. 

In the case of the coefficients b m , 6 a and bp only the tree-level result 

€ ] = ~h (3-19) 
&i 0) = 4 0) = 1, (3.20) 

is currently available [8,23]. The corrections associated with these coefficients 
may be non-negligible at the larger quark masses. 

3.4 Renormalization conditions 

A complete specification of a mass-independent renormalization scheme re- 
quires that we fix the finite parts of the renormalization constants Z g , Z m and 
by imposing an appropriate set of renormalization conditions. Different 
schemes are then related by transformations of the form 

9r -> dlX g (gl), to r -> m R X m (g R ), (j) R -> ^1^^) ( 3 - 21 ) 

(up to corrections of order a 2 ). 

For perturbation theory minimal subtraction is a technically attractive 
renormalization prescription. It is defined by the requirement that the expan- 
sion coefficients Z% \ Z^ and Z^ are polynomials in In (a/i) with no constant 
term, to any order / > 1 of perturbation theory. 

At the non-perturbative level mass-independent renormalization schemes 
are not as easy to define. For such a scheme to be practically useful, the renor- 
malization constants Z g , Z m and should be computable through numerical 
simulations. Since they refer to properties of the theory at zero quark mass, 
one is then confronted with the problem of simulating QCD with very light (or 
even massless) quarks. 

A scheme where this difficulty is bypassed has been described in ref. [2]. 
The proposition is to impose all renormalization conditions on a set of correla- 
tion functions derived from the QCD Schrodinger functional. In this framework 
an infrared cutoff is provided by the finite extent L of the lattice (given in phys- 
ical units) and one may then safely set the quark mass to zero without running 
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into singular quark propagators. The philosophy which goes along with this 
approach has been explained in ref. [2] and a possible choice of renormalization 
conditions is described there. 



4. Schrodinger functional 

Most of the details given in this section have previously appeared in the 
literature [13-15,17]. Our aim is not to motivate the use of the Schrodinger 
functional or to explain its basic properties. For this the reader should consult 
the papers quoted above and also refs. [20,21], where an alternative approach 
(using the temporal gauge and spectral boundary conditions for the quark 
and anti-quark fields) is discussed. Instead we would like to briefly recall the 
relevant definitions and to set up the notations that will be used in the rest of 
this paper and in refs. [23-25]. We first introduce the Schrodinger functional 
without 0(a) improvement and derive the required correction terms in sect. 5. 

4-1 Lattice geometry and fields 

In general the formulation of the theory is as in sect. 2 except that the lattice 
is now taken to be of finite extent in all directions. The possible values of 
the time coordinate xq of a lattice point x are then xq = 0, a, 2a, . . . , T. At 
fixed times the lattice is thought to be wrapped on a torus of size L 3 . In other 
words, all fields are assumed to be periodic functions of the space coordinates 
with period L. 

At the boundaries xq = and xq = T we impose Dirichlet boundary 
conditions. In the case of the gauge field the requirement is that 

U{x, k)\ Xo=0 = W(x, ft), U{x, k)\ Xo=T = W'(x, ft), (4.1) 

where W(x,k) and W'(x,k) are some externally prescribed fields. More pre- 
cisely, we choose a smooth continuum gauge field Cfc(x) and set 

W(x, k) = Vexp dtC k (x + ak - iak)| . (4.2) 

Similarly, W'(x, k) is given by another field C£(x). In eq. (4.2) the symbol V 
implies a path-ordered exponential such that the fields at the larger values of 
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t come first. Note that the temporal gauge field variables U(x,0) (which are 
defined for < xq < T) remain unconstrained. 

The dynamical degrees of freedom of the quark and anti-quark fields ip(x) 
and ip(x) reside on the lattice sites x with < xq < T. At the boundaries only 
half of the Dirac components are defined and these are fixed to some prescribed 
values Explicitly, if we introduce the projectors P± = ^(1 ± 70), the 

boundary conditions on the quark field are 

P+t%)L=o = />(x), P_t%)L=T = /(x), (4.3) 

while for the anti-quark field we require that 

#e)P_U =0 = /o(x), lp(x)P + \ X0=T = p'(x). (4.4) 

For consistency the boundary values must be such that the complementary 
components P-p, . . . , p'P- vanish. 

4.2 Action 

In the interior of the lattice the action density is given by the same expressions 
as on the infinite lattice considered previously. We however need to specify the 
precise form of the action in the vicinity of the boundaries xq = and xq = T. 
In particular, for the Wilson plaquette action we now write 

SgW] = ^ J>(p)tr{l - U(p)}, (4.5) 

#0 p 

where the sum runs over all oriented plaquettes p whose corners have time 
coordinates xq in the range < xq < T. The weight w(p) is equal to 1 for all 
p except for the spatial plaquettes at xq = and xq = T, which are given the 
weight i. 

To be able to write the quark action in an elegant form it is useful to 
extend the fields to all times xq by "padding" with zeros. In the case of the 
quark field this amounts to setting 

ij){x) = if x < or x > T, (4.6) 

and 

P-#e)U = o = P+#e)U =t = 0. (4.7) 



17 



The anti-quark field is extended similarly and the so far undefined link variables 
are set to 1. With this notational convention the Wilson-Dirac operator (2.3) 
is well-defined at all times and the quark action is again given by eq. (2.4). 

An important detail we should mention at this point is that an unconven- 
tional phase factor A^ has been included in the definition (A.13)-(A.17) of the 
covariant lattice derivatives and V*. This factor is equivalent to imposing 
the generalized periodic boundary conditions 

ij){x + Lk) = e l(>k ij(x), ~${x + Lk) =~${x)e- i6k , (4.8) 

as one easily proves by performing an abelian gauge transformation. The 
present formulation, where the fields are strictly periodic and A^ appears in 
the difference operators, is technically simpler. In any case, the angles 6^ 
parametrize a family of admissable boundary conditions and give us further 
opportunities to probe the quark dynamics. 

4-3 Functional integral and correlation functions 
The Schrodinger functional 

Z[C',p',p';C,p,p] = J D[f/]D[^]D[^]e- s [^] (4.9) 

involves an integration over all fields with the specified boundary values. Note 
that the integration measure does not depend on the boundary values of the 
fields. All dependence on the latter derives from the action. 

The expectation value of any product O of fields is given by 

(0) = \^ [B[U]B[^]B[^]Oe- s ^^\ . (4.10) 

Apart from the gauge field and the quark and anti-quark fields integrated over, 
O may involve the "boundary fields" 

C(x > = ikr C[x) = ~m' (4 - n) 

( 'M = mry <"'"" = < 4 - 12 ' 

These are perfectly meaningful in eq. (4.10) (the derivatives act on the Boltz- 
mann factor) and have the effect of inserting certain combinations of ip(x) 
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and ip(x) close to the boundaries of the lattice, together with the appropriate 
gauge field variables to ensure gauge covariance (see appendix C for the precise 
definition of the variational derivatives). 



5. O(a) improvement of the Schrodinger functional 

The reader may wish to skip this section on first reading, because the 
discussion of the PC AC relation in sect. 6 is only marginally dependent on the 
results obtained here. For a more solid understanding of our approach it is 
however useful to know that the Schrodinger functional is well-behaved in the 
continuum limit and that the improvement programme extends to this quantity 
in a straightforward manner. A further motivation to include this section is 
that we would like to prepare the ground for the computation of the running 
coupling and the running quark mass along the lines described in ref. [2]. 

In the case of the pure gauge theory, the problem of the renormalization and 
0(a) improvement of the Schrodinger functional has already been addressed 
in subsects. 2.5 and 4.5 of ref. [15]. The argumentation presented there carries 
over literally to QCD with any number of quarks. We shall, therefore, restrict 
the discussion to those aspects that we believe are new or worthwhile to be 
reconsidered. 

5.1 Continuum limit and scope of improvement 

In the following the boundary values C, C , p, . . .,p' are taken to be linear 
combinations of a finite number of plane waves with coefficients and momenta 
that are kept fixed as the lattice spacing is sent to zero. Since we assume 
periodic boundary conditions in the space directions, the momenta must be 
integer multiples of 2ir/L. 

As for the correlation functions we restrict attention to the expectation 
values of products of local composite fields (f>(x) and the Fourier components 
of the boundary quark fields ((y), ■ ■ . ,C'( Z )- We assume that the fields (f>(x) 
are inserted at non-zero (physical) distances from the boundaries and from 
each other. 

The renormalization of the Schrodinger functional is achieved by express- 
ing the bare parameters through the renormalized parameters and by scaling 
the boundary values of the quark and anti-quark field with a renormaliza- 
tion constant [15,17]. For fixed renormalized boundary values, as specified 
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above, the continuum limit can then be taken and one ends up with a well- 
defined functional of the boundary values of the gauge and quark fields. By 
differentiating with respect to the latter, the continuum limit of the correlation 
functions of the Fourier components of the boundary fields C(y), . . . ,C'( Z ) is 
obtained at the same time. Operator insertions in the interior of the space-time 
volume require additional renormalizations as in infinite volume. 

It should be noted, however, that the operator insertions and the boundary 
fields are treated quite differently. In the first case the insertions are made at 
positions that are at non-zero distances from each other and from the bound- 
aries. A discussion of short distance singularities is thus avoided, and the 
renormalizations needed are just those that are already required in on-shell 
matrix elements of the operators. The correlation functions of the boundary 
fields, on the other hand, are obtained in momentum space by differentiation 
of the Schrodinger functional. Arbitrary (finite) products of the Fourier com- 
ponents of these fields can occur. Short distance singularities associated with 
such products are automatically taken care of on the level of the Schrodinger 
functional, where they would show up in the form of divergent local boundary 
terms, constructed from the boundary values of the fields. 

Our aim in this section is to study the cutoff dependence of the correlation 
functions of the type mentioned above and to determine the required 0(a) 
counterterms. The improvement of the fields (f>(x) is as in infinite volume and 
will not be considered again. We are then left with the correlation functions of 
the boundary quark fields. As in the case of the renormalization of the theory, 
it is simpler to discuss the improvement of the Schrodinger functional itself. 
Once the latter is improved the correlation functions will be improved, too. In 
particular, no further subtractions will be needed, and the 0(a) counterterms 
S((y), • • -j^C'( z ) are a U equal to zero. 

5.2 Boundary terms and effective theory 

The approach of the Schrodinger functional to the continuum limit can be 
described by a local effective theory. Compared to our discussion in sect. 2 
of the infinite volume theory the principal difference is that we now need to 
include further terms in the effective action to account for boundary effects. 
The general form of the correction terms in eq. (2.5) then is 

Sk = J #xC k {x) + )imJ d?x{B k (x)\ Xo=t +B' k (x)\ Xo=T _ t }, (5.1) 

where B k {x) and Bh(x) are linear combinations of local composite fields of 
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dimension 3 + k. A technical remark we should make here is that the limit 
e — t- is non-trivial in general and requires that the coefficients in the linear 
combinations are scaled in a particular way [13,14]. This complication is not 
very important in the present context, because we are only interested in the 
general form of the boundary terms, which is determined by symmetry con- 
siderations. The relevant symmetries are all internal symmetries of the lattice 
theory and the discrete space rotations and reflections. Moreover Bk(x) and 
Bk(x) are related by a time reflection so that only one of them needs to be 
discussed. 

In subsect. 2.3 we have shown that under certain conditions the field equa- 
tions may be used to reduce the number of terms contributing to the effective 
action. The same arguments may be applied here, first to simplify the effective 
lagrangian C\(x) and then also the boundary terms B\(x) and B[(x). The limit 
e — t- is helpful at this point, because the applicability of the field equations to 
reduce the number of terms is made obvious. As for the volume term, Ci(x), 
we conclude that it can be taken to be the same as in the infinite volume theory. 
For the improved lattice action we thus write 

SimpMlp, ^] = S[U,^, V] + ss v [u,lp, V] + SS G , b [U] + ss F , b [u,lp, V], (5-2) 

T-a 

SS v [U,ip,tp] = a 5 ^2^2c sw ip(x)^a^F^(x)ip(x) 1 (5.3) 

xo = a x 

where the notation is as in subsect. 2.4 (the index "v" indicates a volume 
counterterm, while the boundary counterterms are labeled by "b"). In the 
following we deduce the form of the boundary counterterms. 
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Table 1. Improvement coefficients c B 





c t 


c (i,i) 


r (2,0) 


2 -0.166(1) [15] 


-0.0543(5) [15] 


0.0191410(1)[18] 


-0.0115(5) [19] 


3 


-0.08900(5) [16] 


0.0191410(1)[18] 





5.3 Boundary counterterms independent of the quark fields 

In the pure gauge theory any gauge invariant local composite field has dimen- 
sion greater or equal to 4. The only fields that can contribute to B\ and B[ 
are hence given by 

9 = tr {F kl F kl } , (5.4) 

Cio = tr {F ok F ok } . (5.5) 
For the associated O(a) counterterm we may take [15] 

SS G , h [U] = ^(c s - 1) J2 tr {1 - U(p s )} 

+ i( c *- 1 )E tr i 1 - f/ ^)}- ( 5 - 6 ) 

The sums here run over all oriented plaquettes p s and p t at the boundaries that 
are space-like (p s ) or time-like (p t ). The notation is otherwise as in sect. 2. 
In perturbation theory we have 

Cs , t = l + cg( /0 2 + C g ) ^ + ..., (5.7) 

1=0 



(k) 
c. 



The known expansion coefficients for N = 2 and N = 3 are listed in table 1 (the 
numbers in square brackets point to the references from which the coefficients 
have been taken). 
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5.4 Boundary counterterms depending on the quark fields 

The presence of the quark fields allows one to construct many more composite 
fields of dimension 4 with the required symmetries. A basis of such fields is 
given by 

On =^P + D ^ + ^D P_^, (5.9) 
012 = i^P-Dvi, + ^D P + ^, (5.10) 
13 =^P +lk D k ij-^b klk P_ij, (5.11) 
14 =^P- lk D k ij-^D klk P + ij, (5.12) 
15 = m~i>i>. (5.13) 
The formal field equations imply two relations, 

On + 0i3 + Ci5 = 0, (5.14) 

012 " 014 " 015 = 0, (5.15) 

so that two fields can be eliminated. A particularly simple form of the associ- 
ated boundary counterterms to the lattice action is obtained if we choose 

0n,0i4,0i5 at x = 0, (5.16) 

012,013,015 at x =T, (5.17) 

as the basis of fields. Note that we are free to choose a different basis at the 
two boundaries of the lattice. 

The counterterm corresponding to 0i5 is of a special kind, similar to the 
volume counterterms associated with 03 and 05 (cf. sect. 2). To see this we 
consider the term at xq = and choose the lattice field, representing 0i5, to 
be given by 

m q {${y)U{x, U)- l P + ij,{x) + ^{x)P_U{x, 0)t%) } y=x+a o ■ (5-18) 

Next we note that the terms in the Wilson quark action that are linear in the 
boundary values p and p are precisely proportional to the above expression. 
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The boundary counterterm can thus be compensated by rescaling p and p 
by a factor of the form 1 + b^gfyam^. There is also a term in the action 
depending quadratically on the boundary values, but since it is already of 0(a) 
the rescaling of this term produces a correction of 0(a 2 ) which is considered 
negligible in the present discussion. 

Since one anyway has to renormalize the boundary values of the quark 
and anti-quark fields, the 0(a) counterterms corresponding to are hence 
not included in the improved action. If a mass-independent renormalization 
scheme is employed, the counterterm instead appears in the definition 

Cr(x) = Z c (gQ,ap)(l + 6 c am q )C(x) (5.19) 

of the renormalized boundary field Cr( x ) (the other boundary fields are renor- 
malized similarly). 

We are thus left with altogether four boundary counterterms, two at time 
x = and two at xq = T. Their coefficients must be such that the time reversal 
invariance of the theory is preserved. A possible choice of the counterterms 
then is 

5S F , h [U, ^, V] = a 4 J2{ (c s - 1) [O s (x) + 3: (x)] 

X 

+ (ct-i)[a t (x)-a t '(x)]}, (5.20) 

where 

<5 s (x) = ±o(x) 7fc (v£ + V fc )/o(x), (5.21) 

a s '(x) = ±o'(x) 7fc (v£ + V fc )/o'(x), (5.22) 

a t (x) = {^(y)P + V^(y) + Hy)VtP-^(y)} y={ajX) , (5.23) 

a t '(x) = {^(y)P_VoV(y) +?(y)V P + t%)}, =(T _ aiX) • (5-24) 

Note that one does not refer to any "undefined" components of the quark 
and anti-quark fields in these expressions. The projectors P± and the time 
derivatives are always such that this is avoided. 
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As will be shown in ref. [23] the Schrodinger functional in the free quark 
theory with Wilson action is already 0(a) improved. The perturbation expan- 
sion of the improvement coefficients c s and c t is hence of the form 

c ajt = l + c^g 2 +c^g* + ... (5.25) 

It may seem a bit artificial to write the coefficients in eq. (5.20) in the way we 
did. The notation was chosen to emphasize the analogy with the pure gauge 
counterterm (5.6). Moreover the terms appearing in eq. (5.20) combine with 
the terms in the Wilson quark action. At xq = 0, for example, we have 

a(9t(x) = VK 2 " + a ^)' t l J ( x + a 0) 

- ~${x + a6)U(x, 0)~ 1 P + ij(x) - ~^{x)P_U{x, 0)i/>(x + a6). (5.26) 

This counterterm thus contributes to the mass term at xq = a and changes the 
weight of the time-like hopping terms at the boundary from 1 to c t . 
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6. Chiral symmetry restoration 



We now proceed to study the axial current conservation on the lattice. The 
coefficients c sw , ca, etc. are initially assumed to have their proper values so that 
the theory is on-shell 0(a) improved. Chiral symmetry in lattice QCD with 
Wilson quarks has been extensively discussed in the literature (an incomplete 
list of references is [26-29]). Here we show that 0(a) improvement leads to 
a significant reduction of the cutoff effects in the PC AC relation. This may 
conversely be taken as an "improvement condition" to fix c sw and c\. 

6.1 PC AC relation 

In a mass-independent renormalization scheme, the renormalized improved ax- 
ial current and density are given by 

(A R )l(x) = Z A (g 2 , af i)(l + b A am q ) {A^x) + SA^x)} , (6.1) 
(Pn) a (x) = Zp(g 2 ,ai2)(l + b P am q )P a (x) (6.2) 

(cf. subsect. 2.5). The renormalization constants Z A and should be fixed 
by imposing appropriate renormalization conditions. In what follows we do not 
need to know exactly how this is done. We shall however take it for granted that 
the normalization mass fj, is independent of the lattice size (this is the normal 
situation, but the remark is made here, because in the SF scheme discussed in 
ref. [2] one sets fj, = 1/L). 

We now first consider the theory in infinite volume. Since the PCAC 
relation (1.1) holds in the continuum limit, and since the improvement has 
been fully implemented, we expect that 

(i(d* + d,)(A n )l(x) O) = 2m n {(P n ) a (x) O) + 0(a 2 ), (6.3) 

for any product O of renormalized improved fields located at non-zero dis- 
tances from x and from each other. As already indicated by the notation, the 
proportionality constant m R is a renormalized quark mass which can be shown 
to be of the form (3.15) up to corrections of order a 2 . 

There is one aspect of eq. (6.3) which may appear to be trivial but which 
will be most important in what follows. The property we wish to emphasize 
is that the relation holds for any product O of fields localized in a region not 
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containing x. Of course this is just a reflection of the fact that the PCAC rela- 
tion becomes an operator identity when the theory is formulated in Minkowski 
space. Different choices of O then correspond to considering different matrix 
elements of the relation. 

It may nevertheless be useful to briefly recall the standard proof of eq. (6.3) 
in the euclidean framework. First note that the lattice correlation functions 
on both sides of the equation converge to the continuum limit with a rate 
proportional to a 2 . It is hence sufficient to establish eq. (6.3) in the continuum 
theory. To this end one performs a local infinitesimal chiral transformation on 
the quark fields integrated over in the functional integral. We can choose the 
transformation to be trivial outside a small neighbourhood R of the point x. 
The variation SSo of the action is proportional to 

/ d 4 y u a {y) {d,(A R )l(y) - 2m R (P R )°(y)} , (6.4) 

JR 

where oj a (y) is the (position dependent) transformation parameter. The prod- 
uct O is localized outside the region R and is hence not affected by the trans- 
formation. The invariance of the quark integration measure then implies 

(SSo O) = 0, (6.5) 

which reduces to eq. (6.3) after passing to the local limit. 

Exactly the same argumentation may be applied if the theory is set up 
in a finite volume with Schrodinger functional boundary conditions. We are 
thus led to conclude that eq. (6.3) must be valid in this situation too, with 
the same value of ra R , provided the point a; is at a non-zero distance from the 
space-time boundaries. At first sight it may not be totally obvious that the 
proportionality constant m R is indeed independent of the lattice sizes T and L 
(up to corrections of order a 2 ). A second derivation of this important fact is 
therefore given in appendix D. 

We conclude this subsection by noting that eq. (6.3) remains valid if O 
is taken to be a product of unimproved fields or if the 0(a) boundary coun- 
terterms SSq^ an d ^F.b are dropped. In other words, to ensure that the 
lattice corrections in eq. (6.3) are of order a 2 , it is enough to include the 
Sheikholeslami-Wohlert term in the action and to insert the renormalized im- 
proved axial current and density. 

The reason for this is best explained by considering a simple example. 
Let us assume that O is equal to some renormalized improved field <j)n(y) and 
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suppose we drop the 0(a) counterterm aS(f>(y). From the discussion of the local 
effective theory in subsect. 2.2 we then infer that eq. (6.3) receives an order a 
correction given by 



a({d ll (A R )l(x)-2m R (P R ) a (x)}<t> 1 (y)) 



(6.6) 



The correlation function in this formula is to be evaluated in the continuum 
theory. Since the PC AC relation holds exactly in this limit, the correlation 
function vanishes and we conclude that there is in fact no order a correction. 
It is important here that y keeps away from x as otherwise a non-zero contact 
term can arise which would invalidate the argument. 

6.2 Choice of boundary values 

By varying the boundary values of the fields and by considering various field 
products O, the PCAC relation can be probed in very many different ways. 
We now make some definite choices to reduce this enormous flexibility to just 
a few parameters. In doing so we are guided by the requirement of simplicity 
and by practical considerations. 

As in refs. [15,16] the boundary values C and C of the gauge field are 
taken to be constant diagonal matrices. Explicitly, we set 



C k -- 



/<t>i o 

o 4> 2 



V o o ■ 

where the angles (f> a are required to satisfy 



\ 



<f>N J 



(6.7) 



o, 



4>i < 4> 2 < . . . < 4>n, 



4>N — <f>l < 27T. 



(6.8) 



The boundary value C at xq = T is similarly given by another set of angles 
(f>' a . For such boundary values one can prove [15] that the gauge field action is 
minimized by the constant colour-electric field with field strength 



F ( 



Ok 



(C' k -C k )/T, 



F, 



kl 



0. 



(6.9) 



In many respects this field plays the role of a classical background field. 

A good feature of the chosen boundary values C and C is that the trans- 
lation invariance in the space directions is preserved. The Fourier components 
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Fig. 2. Two-dimensional sketch of the space-time manifold. C and C" 
are the boundary values of the gauge field at time and T , respectively. The 
irregular lines represent the space-time trajectories of a quark and an anti- 
quark, which are created at time by the operator (6.10) and annihilate 
each other at the point x. 

of the boundary quark fields may hence be interpreted as operators that create 
quarks and anti-quarks with definite momenta. In particular, the product 



creates a quark and an anti-quark with zero momenta at time 0. The correlation 
functions in eq. (6.3) then are proportional to the probability amplitude that 
the quark anti-quark pair propagates to the interior of the space-time volume 
and that it annihilates at the point x (see fig. 2). As usual such pictures have a 
precise meaning if the quark lines are thought to represent quark propagators 
at the current gauge field. The derivation of the corresponding Feynman rules 
from the functional integral is straightforward and will be discussed in some 
detail in ref. [23]. 

With the choices made so far the kinematical parameters left to play with 
are the lattice sizes T and L, the time xq at which the axial current is inserted, 
the angles (f> a and (f>' a characterizing the boundary values of the gauge field and 
the angles 9k appearing in the quark action (cf. appendix A). The latter could 




(6.10) 
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be traded for a shift of the spatial quark momenta by the vector (#1, 02,03)/L, 
but as already explained in sect. 4 we prefer to formulate the theory in terms 
of strictly periodic fields and to consider the angles 9k as some parameters that 
allow us to probe the quark dynamics in an interesting way. 

6.3 Tests of chiral symmetry 

The size of the error term in eq. (6.3) (with C, C and O as specified above) can 
be taken as a measure for the violation of chiral symmetry on the lattice. In the 
following lines we show how to separate the lattice effects from the universal 
terms in the equation. A strategy to calculate the coefficients c sw and c A will 
then emerge. 

We begin by introducing the bare correlation functions 

AM = -a 6 ^|K( a; )C(y)75ir»C(z)), (6.11) 

fp( X ,) = - a 6^I ( pa (a;) ^ y)75 I r a C(z)) _ {f ..l2) 



A summation over the flavour index is implied here and the 0(a) boundary 
counterterms are omitted from the improved action (cf. subsect. 6.1). A an d 
fp depend on the bare parameters go an d m o, the coefficient c sw at the chosen 
coupling, and the kinematical parameters listed in subsect. 6.2. There is no 
dependence on the spatial coordinates of x because of translation invariance. 

The PCAC relation (6.3) implies that the time derivative of f A is propor- 
tional to fp up to cutoff effects. So if we define the ratio 



m 



= \ [\{dt + d )fx{xo) + c A adodofp(x )] /fp(x ), (6.13) 

and if we again assume that the coefficients c sw etc. have their proper values, 
it is straightforward to show that 

Z A (l + b A am q ) 2 
mn = m - + 0a 6.14 

Zp (1 + OpCSTOq) 

[cf. eqs. (6.1), (6. 2)]. The ratio m may be regarded as an unrenormalized cur- 
rent quark mass, but it should be noted that m depends on the same parameters 
as /a an d fp an d additionally on c A . 

We now consider two configurations of the kinematical parameters at the 
same point (go,amo) in the bare parameter space. The simplest possibility 
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is to insert the axial current at two different times, but we might also choose 
C = C = in one case and some non-zero C and C in the other. Let toi 
and m,2 be the associated values of the unrenormalized current quark mass to 
defined above. Since to r and the renormalization factor in eq. (6.14) do not 
depend on the kinematical parameters, it follows that 



By calculating toi and m^ we thus obtain a direct check on the size of the lattice 
effects. Note that if one is interested in computing the renormalized quark 
mass to r via eq. (6.14), a non-zero value of toi — m^ implies a corresponding 
variation in to r . In other words, to r is only computable up to such systematic 
uncertainties. The cutoff effects acquire a concrete meaning in this way which 
allows us to judge whether they are of any practical importance on the lattices 
considered. 

6.4 Chiral symmetry violation at tree-level of perturbation theory 

A first impression on the size of the cutoff effects can already be gained at tree- 
level of perturbation theory. The required computations are straightforward 
and will be discussed in more detail in ref. [23]. In the limit go — > the 
gauge field is frozen to the minimal action configuration with field strength 
as given by eq. (6.9). One then has to calculate the quark propagator in this 
classical background field. The correlation functions /a an d fp are obtained 
by taking the product of two propagators with apropriately contracted indices. 
As already mentioned before, we have c sw = 1 and ca = to lowest order of 
perturbation theory. 

If we choose zero boundary values, C = C = 0, the background field 
vanishes and the quarks propagate freely. The unrenormalized current quark 
mass to can be calculated analytically in this case. It turns out that to is 
independent of the time xq and that 



as expected for free quarks [m p denotes the "pole mass", eq. (3.6)]. The size 
of the error term in this equation depends on T, L, too and the angles 9k, but 
is generally found to be small for reasonable choices of these parameters. On 
a 16 X 8 3 lattice with am^ = 0.1 and 9k = 0, for example, we have a(m — 
TO p ) = 0.0006. Uncertainties in the renormalized quark mass of this order of 
magnitude are usually considered negligible. 




(6.15) 




(6.16) 
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Fig. 3. Plot of am at tree-level of perturbation theory. All data are 
for a 16 x 8 3 lattice with arrio = 0.01 and 9k = 0. The boundary values of 
the gauge field are zero (open circles) or as given by eq. (6.17). Full circles 
and crosses correspond to c sw = and c sw = 1 respectively. 

The situation becomes more interesting in the presence of a non-zero back- 
ground field. To illustrate this we consider the theory with gauge group SU(3) 
and choose 

{(kiihih) = I (-7r,0,7r), 

(6.17) 

= | (-5tt, 2tt, 3tt) . 

As shown in fig. 3 the corresponding values of m now depend on c sw and also 
on xq. Without improvement (i.e. for c sw = 0) a very large deviation from 
the free quark value is observed. Such uncertainties in the quark mass deter- 
mination are clearly not tolerable. One might suspect that an unreasonably 
large external scale has been induced into the system by our choice of boundary 
values for the gauge field. But this is not the case, since the background field 
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has zero frequency and its strength is small in lattice units (on a 16 X 8 3 lattice 
we have \a 2 F^\ < 0.028). 

The problem disappears almost completely if we set c sw = 1. Improvement 
thus works very well. The residual cutoff effects seen in fig. 3 are of order 
a 2 . They increase significantly towards the boundary xo = T, but this is the 
expected behaviour when the distance T — xq, measured in numbers of lattice 
spacings, becomes small. In the middle of the lattice the differences between 
the calculated quark masses are again negligible. 

6.5 Strategy to compute c sw and ca 

It should be quite obvious now that the coefficients c sw and ca can be calcu- 
lated by requiring the unrenormalized quark mass m to be independent of the 
kinematical parameters (up to terms of order a 2 ). We need to consider three 
different kinematical configurations, corresponding to mass values nil, and 
TO3. The two coefficients are then to be adjusted so that ni\ = m-i +0(a 2 ) and 
m,2 = nis + 0(a 2 ). 

A difficulty which becomes apparent at this point is that c sw and ca are 
not uniquely determined by these equations unless the order a 2 corrections 
are negligible. In perturbation theory the problem is not felt, because the 
lattice spacing can be made arbitrarily small compared to the external length 
scales. The coefficients thus have a unique expansion in powers of go and the 
computational strategy described above applies straightforwardly [23]. 

At non-zero values of go one also has the dynamically generated mass 
scales such as the pion decay constant F v . In units of these scales the lattice 
spacing is a function of go so that at fixed coupling there is no way to reduce an 
error term proportional to say (aF v ) 2 . The coefficients c sw and ca are hence 
ambiguous by terms of order aF v . There is nothing fundamentally wrong with 
this. The ambiguity simply reflects the fact that 0(a) improvement is an 
asymptotic concept, valid up to higher-order corrections. 

In practice our aim is to cancel the largest contributions to the lattice ef- 
fects in the accessible range of lattice spacings. This can be achieved by requir- 
ing nil = m 2 = m 3 to hold exactly for a definite set of well-chosen kinematical 
configurations. Of course one should check that different such "improvement 
conditions" yield consistent results within small variations. Eventually one 
defines the improved theory by adopting a particular set of improvement con- 
ditions. Any residual cutoff effects then still have to be extrapolated away by 
calculating the quantities of interest on a sequence of lattices with decreasing 
lattice spacings. A non-perturbative calculation of c sw and ca along these lines 
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will be reported in ref. [24]. 

6. 6 Cutoff effects and the critical bare mass 

Although 0(a) improvement is very efficient in reducing the lattice effects, 
one should not forget that chiral symmetry remains to be an only approxi- 
mate symmetry of the lattice theory. In particular, the critical line m c (jo) is 
not unambiguously defined. This has previously been mentioned and is here 
discussed in some more detail (cf. subsect. 3.2). 

The critical bare mass is loosely defined as the value of too at which the 
physical quark mass vanishes. From chiral perturbation theory one expects 
that the pion becomes massless at this point and the critical line is hence often 
characterized in this way. Another option is to define an unrenormalized cur- 
rent quark mass to through a ratio of correlation functions, as in subsect. 6.3, 
and to search for the value of too where to vanishes [2,29]. This method is not 
restricted to a particular physical situation, because the PCAC relation (6.3) 
holds for any choice of boundary conditions, field product O and lattice sizes 
T and L. 

The problem now is that the pion mass and the quark masses that one 
extracts from the PCAC relation do not pass through zero at exactly the same 
value of too. As a result of the lattice corrections in eq. (6.3), one rather finds 
that the calculated critical bare masses vary by terms of order a 2 (without 
improvement the uncertainty would be of order a). 

In perturbation theory these ambiguities are unimportant, because the 
lattice spacing can be taken to zero at any finite order of the expansion. At the 
non-perturbative level some residual cutoff effects are always present, as in the 
case of the coefficients c sw and ca discussed above. One should also take into 
account that numerical simulations can only be performed for limited ranges 
of the external scales. Calculations of the critical bare mass may then also be 
biased by lattice effects associated with these scales. 
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7. Concluding remarks 



So far we have been mainly interested in the case of small quark masses 
and so did not pay too much attention to the coefficients multiplying the terms 
proportional to am q (such as 6a an d bp). There are a number of these coef- 
ficients and much work will be needed to get them under control. While it is 
clear how to proceed in perturbation theory, it may not be so easy to disen- 
tangle the cutoff effects proportional to am q from physical quark mass effects 
at the non-perturbative level, using numerical simulations. 

Our discussion of the improvement of the axial current and density in 
subsect. 2.5 can be readily extended to other local fields. For low-dimensional 
fields, such as the isovector vector current [2], the 0(a) counterterms have a 
simple form. One may then be able to determine the associated coefficients by 
requiring the improved fields to transform in the expected way under chiral 
rotations up to corrections of order a 2 . 

Once the improvement has been fully implemented, the chiral Ward iden- 
tities may be used to relate different renormalized fields. An interesting case 
to consider is the scalar density ipip. This field mixes with the constant field 
under renormalization. At zero quark mass the subtraction constant and also 
the multiplicative renormalization constant may now be fixed by insisting that 
the renormalized scalar density Sr should be related to the axial density (Pn) a 
through an infinitesimal (isovector) chiral transformation. The expectation 
value (Sr) then becomes an order parameter for spontaneous chiral symmetry 
breaking in large volumes. 

We would like to thank Hartmut Wittig and Ulli Wolff for a critical reading 
of a first draft of this paper. 



Appendix A 

A.l Index conventions 

Lorentz indices fj,,v, . . . are taken from the middle of the Greek alphabet and 
run from to 3. Latin indices k,l, . . . run from f to 3 and are used to label the 
components of spatial vectors. For the Dirac indices capital letters A, B, . . . 
from the beginning of the alphabet are taken. They run from f to 4. Colour 
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vectors in the fundamental representation of SU(7V) carry indices a, (3, . . . rang- 
ing from 1 to JV, while for vectors in the adjoint representation, Latin indices 
a,b, . . . running from 1 to N 2 — 1 are employed. By abuse of notation such 
indices are also used for the flavour label of the axial current and density. 

Repeated indices are always summed over unless otherwise stated and 
scalar products are taken with euclidean metric. 

A. 2 Dime matrices 

We choose a chiral representation for the Dirac matrices, where 




(A.l) 



The 2x2 matrices e„ are taken to be 



e = -1 



e k = -ia k 



(A.2) 



with Ok the Pauli matrices. It is then easy to check that 



7A = 7m 



(A.3) 



Furthermore, if we define 75 = 70717273 > we have 




(A.4) 



In particular, 75 = 7^ and 7! = 1. The hermitean matrices 



a 




(A.5) 



are explicitly given by 




(A.6) 



where is the totally anti-symmetric tensor with 6123 = 1. 
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A. 3 Gauge group 

The Lie algebra su(7V) of SU(7V) can be identified with the space of complex 
N X N matrices X a p which satisfy 

X f = -X, tr{X} = 0, (A.7) 

where denotes the adjoint matrix of X and tr{X} = X aa is the trace of 
X . We may choose a basis T a ,a = 1,2, ... , N 2 — 1, in this space such that 

tr{T a T b } = -\5 ab . (A.8) 

For N = 2, for example, the standard basis is 

T a = T —, « = 1,2, 3, (A.9) 

where r a denote the Pauli matrices. With these conventions the structure 
constants f ahc , defined through 

[T a ,T b ] = f abc T c , (A.10) 

are real and totally anti-symmetric under permutations of the indices. 

A. 4 Lattice derivatives 

Ordinary forward and backward lattice derivatives act on colour singlet func- 
tions f(x) and are defined through 

d ll f(x) = ^[f(x + afi)-f(x)], (A.ll) 

dkf(x) = ±[f(x)-f(x-afi,)], (A.12) 

where fi denotes the unit vector in direction fj,. The gauge covariant derivative 
operators, acting on a quark field ip(x), are given by 

V ^(x) = i [XfJJ (x, n)ip{x + afi) - il>(x)] , (A. 13) 

V* VK^) = — [VK 2 -) — ^ X U(x — afi, ji)~ 1 ij}{x — a/2)] . (A- 14) 
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The origin of the phase factors 



X^ = e ia9 ^ L , 0o = O, -7r<0 k <K, (A.15) 

is explained in subsect. 4.2. They depend on the spatial extent L of the lattice 
and are all equal to 1 on the infinite lattice. The left action of the lattice 
derivative operators is defined by 

~^(x)V ^ = i [4>{x + afi)U(x, - 4>{x) ] , (A. 16) 

^(a:)V* = — [VK 2 -) — i>{ x ~ a£i)U(x — a/i,^)A |U ] . (A. 17) 



A. 5 Continuum gauge fields 

An SU(A r ) gauge potential in the continuum theory is a vector field A^(x) with 
values in the Lie algebra su(A^). It may thus be written as 

A^x) = Al(x)T a (A.18) 

with real components A a {x). The associated field tensor, 

F^{x) = d^A v {x) - dyA^x) + [A4x),A u (x)], (A.19) 

may be decomposed similarly and the right and left action of the covariant 
derivative is defined by 

D^(x) = (d fl + A fl + i0 fl /L)ij(x), (A.20) 
^{x)D^ = ^{x){d^- A^-i0jL). (A.21) 

The abelian gauge field iO^/L appearing here corresponds to the phase factors 
A^ in the lattice theory [eqs. (A.13)-(A.17)]. 
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Appendix B 



Under charge conjugation the gauge field transforms according to 

U(x,fi)->U(x,fi)*. (B.l) 

The transformation law for the quark and anti-quark fields reads 

ip(x) -> C- l ^{x) T , ^(x) -> -^(x) T C, (B.2) 

where C is a 4 X 4 matrix satisfying 

j^-Cj.C- 1 . (B.3) 

If the Dirac matrices are chosen as specified in appendix A, we may take 
C = «7o72 so that C _1 = = C. 

It follows from these definitions that the Wilson action is invariant under 
charge conjugation. This is true both on the infinite lattice and for Schrodinger 
functional boundary conditions. In the latter case the transformation is applied 
to the field variables at all sites of the lattice including the boundaries xq = 
and xq = T. 



Appendix C 



Let F[p, p] be a monomial in the boundary values p(x) and p(x) of degree 
dp. The Dirac components of the boundary values are not all independent 
since 

P-p(x) = and ^(x)P+ = 0. 



The basic property of the variational derivatives, 



dt 

d_ 

di 



F[p + tu, p] 
F[p, p + tv\ 



t=0 



-i SF 
5p(x 



v x 



5F 



(C.l) 

(C.2) 
(C.3) 
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is hence not sufficient to define them uniquely. Uniqueness is achieved by 
imposing the constraints 

5F 5F 
— -P_=0 and P + — — = 0. (C.4) 

A sum over the Dirac, colour and flavour indices is implicit in eqs. (C.2),(C.3). 
As usual the definition of the variational derivatives extends to arbitrary poly- 
nomials F[p, p] by linearity. In the same way one also defines the derivatives 
with respect to the other boundary values /o'(x) and /o'(x). 



Appendix D 

In this appendix we show that the mass m R appearing in the PCAC relation 
(6.3) must be independent of T and L up to corrections of order a 2 . Since the 
error term in eq. (6.3) is also of this order, it suffices to consider the continuum 
theory and to prove that m R does not depend on T and L in this limit. 

We first fix L and discuss what happens when T is changed. Let V be 
any product of fields localized in the interior of the space-time manifold. The 
key observation is that 

Of (V)con = ~ J d 3 z {%{z)V) coni (D.l) 

where H(z) denotes the energy density and zq must be greater than the time 
coordinates occurring in the product V and less than T. It is straightforward 
to deduce this from the familiar quantum mechanical representation of the 
euclidean correlation functions [15,17]. 
We now consider the expression 

Q = {dM*)» ~ 2m R (P R ) a ( a; )} (Pn) a (y), (D.2) 

where x and y are such that < xq < yo < T. From the above and eq. (6.3) 
we then deduce that 

o = ^(Q)co„ 

= -2^((P n r(x)(P IL r(y)) con - J d 3 z(7i(z)Q) con . (D.3) 
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The last term in this equation vanishes because of the PCAC relation (6.3) 
with O replaced by T-L{z){P^) a {y). Since the two-point function of the axial 
density does not vanish for general x and y, we conclude that dm^/dT = 0. 

For this argumentation to work out we did not need to refer to any spe- 
cial property of the Schrodinger functional boundary conditions. With peri- 
odic boundary conditions in the time direction the same conclusion would be 
reached. Moreover, since the boundary conditions do not matter in the limit 
T —7- oo, one infers that to r has to be the same in both cases. 

To discuss the dependence of to r on L we are hence free to choose periodic 
boundary conditions in all directions. We may then interchange the time with 
one of the space axes and repeat the argumentation given above to deduce that 
to r must be independent of the extent of space-time in this direction, too. 
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